Journal article
A Multi-pass Sieve for Clinical Concept Normalization
Yuxia Wang, Brian Hur, Cornelia Verspoor, Tim Baldwin
Traitement Automatique des Langues (TAL) | Association pour le Traitement Automatique des Langues (ATALA) | Published : 2020
Abstract
Clinical concept normalization involves linking entity mentions in clinical narratives to their corresponding concepts in standardized medical terminologies. It can be used to determine the specific meaning of a mention, facilitating effective use and exchange of clinical information, and to support semantic cross-compatibility of texts. We present a rule-based multipass sieve approach incorporating both exact and approximate matching based on dictionaries, and experiment with back-translation as a means of data augmentation. The dictionaries are built from the UMLS Metathesaurus as well as MCN corpus training data. Additionally, we train a multi-class baseline based on BERT. Our multi-pass ..
View full abstractGrants
Funding Acknowledgements
This work was supported by China Scholarship Council (CSC) and the University of Melbourne.We are grateful to the anonymous reviewers for their insightful comments.